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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GCAAAATTTG CTAAGATGAA GCGCCGGCTT GTTACATTGC TTTTCAGAGT CGATTGGTTC 6 0 

AAAATTGTCA ATTTTATCCA AAATAGAGTG CATTGTGTGT ACAATAACTA AAGAATCATC ' 12 0 

CAT AT C T GGT CCAACACAAC ATTGATGGAA TACTGGATCA ATTGTCTAAA AAAATATCAA 18 0 

TAGAATAATG AAACATTTTC AGAATTCATT ACCGTCAATG TCAGATAGTC ATTCCTTGAG 24 0 

TATTTTGTGG ATGCTTTGAA AATTCTTCGC TGGGCCATAT C TGTTGG AT A ATCTGAAAAA 3 00 

CGCAATAAAT TTCATCGAAA ATGCCTATTA AATTGAATTA CCTTCTTCTT CATCATTTCC 3 60 

TAACAATTCA TGCTCTTTTT GTGCTTGACT TGTGACCAAT TCTTTAAATT CAATTAAATC 42 0 

GTCAATATCC TTTTGTACTA AATCCATCTT GATATTCAAT ATATCTTTGT CAGTATAGTA 4 80 

TTCAGCGTAT CTGAAATTTC GAATTTATTT TTCTAATTCC CAAGAAAAAT AATTAATAAG 54 0 

AATACCTTAA CGAATTATTA TCCAATATAT CATCATTTGC CACATCTGGA AGACGCTGAG 60 0 

GAACTGTTTG AGCAGCTTGG AGGTAGTCGT CATCGTCTCT GGAAATTGTT ATTTTCAATT 66 0 

TCAAAAAAAA AACTTTACTT ACGAAATATA CTCATTTGAT GCAATCCACG GATCAAAACG 72 0 

ACGTCTTTGC ATCTTTGAAT CATTTTCCGC ATGGCACCGC ATCACTTCTT TCTTATGATT 78 0 

ATTTTCTAAC GTTTTTGAAA ATTCGACGTG CTCTTCACAA CGGCCGCCAT GTTTCGCAAG 84 0 

TTCTTCTTTT GATCGTATCT AAAATTTTAA ATTTGAAAAA AAGCTTACTA TCAAATTTTC 90 0 

GTATTTTTTC TCACCTGCTT ACACCGAACA AGCGTTCGAT ACGAAGCATA ATTACATTGT 96 0 

CCATACTTAT TTTTGTCGTA TTCATTGGCA ACAAGACGGA ATCGTGTTCC AGGTGCAACT 102 0 

ATATATTGAG CAGGAGGACG AGTTGTTTGT TTCATGCTGC TTAAAAATAA AAATGGAAAA 10 8 0 

TTGAGTCAAA AAGTTGAGAT AAAACAAATT AAAACAATTT TCTGAAAAAT AAACAACTGA 114 0 

AATTTGAAGT AATAAACAAC ACGCGAAAAC GTTATTTCGG AGCATCGTTT GAGAAGTAAA 12 00 

ACTTTTTTTC GGCGCACCCT TGTGCGCAGT TTTTATCTTC TCTTTTAATT TAATTTTCAA 12 60 

GCTAAATCTT TCTTTTTAAA CTTTGAATAA ATATTTAAAT ATTCAGAATG C AC C AAT AAA 13 2 0 

CCTGGAACAA AATCGATAAT GTTCCGCAAG CTTGGTTCTT CTGGGTCACT ATGGAAGCCG 13 8 0 

AAAAATCCGC ATTCTTTGGA ATACCTCAAA TATTTACAAG GAGTGCTCAC AAAAAATGAG 14 4 0 

AAAGTTACGG AAAACAATAA GAAAATATTA GTAGAAGCAT TACGAGCTAT CGCAGAAATT 15 0 0 

CTCATTTGGG GCGATCAGAA TGATGCTTCG GTTTTTGAGT GAGTTTTTTT CCAATGTTTT 15 60 

TTTTCAAATC TGATGTTGAA TTTCAGTTTC TTCCTTGAGC GGCAAATGCT TCTTTATTTC 162 0 

TTGAAAATTA TGGAACAAGG AAACACACCA CTAAATGTAC AATTACTGCA GACTTTGAAC 16 8 0 

ATTTTATTCG AAAATATTCG ACATGAAACT TCACTTTGTA AGTTTTTTAT ATGGATTTTC 174 0 

GCTTAAAATT GCCAGTTTTC AGATTTCCTT CTAAGTAACA ATCATGTAAA CTCGATTATT 18 00 

TCCCACAAAT TCGATTTACA AAATGATGAG ATCATGGCTT ACTACATTAG TTTTCTGAAA 18 60 

ACTCTTTCAT TTAAACTGAA TCCAGCTACA ATCCACTTCT TCTTCAATGA AACGACTGAA 192 0 

GAATTTCCAT TGTTGGTAGA AGTTTTGAAG CTTTATAATT GGAATGAATC AATGGTTCGA 198 0 

ATTGCTGTTA GAAATATTCT TTTAAATATT GTGAGAGTTC AAGATGATTC AATGATTATT 2 04 0 

TTCGCTATCA AGCATACAAA AGTTAGTAGA AAATTATTTT GAAAAGGTGT ATTTAAGCAA 210 0 

TAAATATTAC AGGAATATCT ATCGGAGTTA ATAGATTCTC TAGTTGGTCT CTCACTTGAA .2160 

ATGGACACAT TTGTACGATC TGCTGAGAAT GTGTTAGCTA ATCGAGAGAG ATTACGAGGA 222 0 

AAAGTGGATG ATTTAATTGA TTTGATTCAT TATATTGGTG AACTATTGGA TGTGGAAGCT 22 80 

GTCGCCGAAA GTTTATCAAT TTTAGGTCAG TTTTACTGCT GGAAAATCAA GTTTTTAATG 2 340 

TTAAATTTTC AGTAACAACA CGATACTTAA GCCCTCTATT ACTTTCAAGT ATATCACCAA 24 00 

GAAGAGATAA TCATTCACTT CTACTCACTC CGATTTCTGC GTTATTTTTT TTCTCTGAAT 24 6 0 

TTTTATTGGT GAGTTTTAAC ATTTAAAATT ACATTTTTCT AATTTATTTA TTTTTCAGAT 2 520 

AGTTCGTCAC CATGAAACAA TATATACATT TTTATCATCT TTCCTATTTG ACACTCAGAA 2 5 80 

TACTTTGACG AC C C ATTGG A TACGTCATAA TGAGAAATAT TGCTTAGAAC CGATTACATT 2 64 0 

ATCATCACCA ACCGGAGAAT ATGTGAATGA AGACCAGTAA GAGCTGAAAT TTTAAAATTT 2 700 

TTGCTTTGAA TATAGTATTT TCAGCGTATT TTTCGATTTT CTACTGGAAG CATTTGATTC 2 7 60 

CAGTCAAGCA GACGATTCGA AGGCATTCTA TGGATTAATG CTGATTTATT CAATGTTTCA 2 82 0 

GAATAATGGT GAGTTTTAAA AAATTG AT T T GTTAAATTAA AATTTCCATT TCCAATAACT 2 880 

CCTCTTCAGA CAGTAAGTTT TCAATGTTGT AAAGTTCCTG TTCATCTGTG ATCGTTTTCT . 2 94 0 

TCATTTTTTT AGTTTTGCAT GAACAGTTTT CAAATTTTTT TGATATCATA CAGTAAATAT 3 000 

CGTCATCCAG ATAATTTTCT ATTTAAAAAA AATGAATAAA AAGAGGGCGC GCAGAAATTG 3 0 60 

CCGAAGTAAT GTAAATTTAA AGGGACACAT GCGTAGCTTG TTGTGTGGGT CTCGCCGCGC 312 0 
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TTTGTTTGAT TTATCTTGTT TTCTGCTGAA AGAGCTGTTT TTATTTTAGC GTTGAATGCT 3180 

TTTTTACCGT TCTCATCGGC TTTTTAATAG GAATATTTAA AAAAAAAGGT TTAATAAATC 3240 

TTCGTTTTTA CAAAATCCAT CTAAGATTTG CATTTGTGAA GCTCAACAAG TAAAGTTTTA 33 00 

AGTAACATTG TTTTTTAAAA AACAATTGAA CCAAATTTTG CCGAAACATT AATAACATGA 3 3 60 

CGATACTCTA TAAAATATTC CTCTTTTCAA AATAAATTTT CAAAAAAAAT CCATTTTTCA 34 2 0 

GCCGATGTTG GAGAACTTCT ATCTGCTGCC AACTTCCCAG TGCTCAAAGA ATCAACGACA 34 80 

ACTTCATTAG CTCAACAGAA TCTTGCTCGT CTCCGAATAG CATCTACGTC TTCCATATCA 354 0 

AAG CGAACGA GAGCTATCAC TGAAATTGGA GTAGAAGCGA CCGAGGAAGA TGAGATTTTT 3600 

CATGATGTTC CTGAAGAACA AACGTTGGTA AGTAAATAAA TCAACATTGA TTGTTACACA 3 660 

AACTTTAATA TTTTTAAATT TGAAAATTTT CTTCAAAGTG CTCAAAAATC CTGTCGAAAA 372 0 

TTACAGGAAG ATCTGGTGGA TGATGTATTG GTTGATACTG AAAATTCAGC AATAAGTGAT 3 7 80 

CCAGAAGTGA GTAGAAAACG TGCATGTATT AATTATTAAA AAAAAAATAT AGTTTTCCCC 3 84 0 

AGTTTTCCTT GACCTAAAAC TCAGCAATTT CAGCCTAAAA ACGTGGAGTC AGAATCTCGT 3 90 0 
TCTCGATTTC AATCTGCTGT TGATGAGCTT CCACCTCCGT CGACTTCTGG ATGTGATGGT - 3 960 

CGACTTTTTG ATGCACTTTC ATCGATTATC AAAGCAGTTG GAACAGATGA CAATCGAATT 4 02 0 

CGACCAATTA CATTGGAACT TGCATGTCTT GTAATTCGGC AAATTTTAAT GACTGTTGAT 4 0 80 

GATGAAAAAG TAAGATTACA AATTCAAAAT TGAGCAAAAT CAGAATCTAA ATTTCATAAA 414 0 

TTGTTCAGGT ACATACCAGT TTAACGAAAT TATGCTTCGA AGTTCGTCTA AAACTTTTAT 42 00 

CATCAATTGG ACAATATGTT AATGGAGAGA ATCTGTTTTT GGAGTGGTTT GAGGATGAAT 4 2 60 

ATGCAGAATT TGAAGTAAGC CAAGAGGTCC GAAAATAATT TAATTCATCC TTTTTATTCA 43 2 0 

GGTGAAT C AC GTGAATTTCG ATATAATCGG TCACGAAATG CTTCTTCCTC CAGCTGCAAC 43 80 

TCCTCTTTCG AATCTGCTAC TTCATAAGCG ATTGCCCAGT GGATTTGAAG AACGAATAAG 444 0 

AACTGTAGGA AACTTTTTAA ATTTGAAAAT TAATTATATA TATATTTGCA GCAAATCGTA 4500 

TTCTACCTAC ATATTCGAAA ATTGGAACGA GATTTGACCG GTGAAGGAGA CACAGAATTA 4 5 60 

CCTGTGAGAG TGTTGAATTC TGATCAGGAA CCAGTTGCCA TCGGTGATTG TATTAATTTA 4 62 0 

CGTGAGTTCA TCTGCATAGA AAACACCATA TTTCTACTCA AATTAACAAT TTTCAGATAA 4680 

TTCGGATCTT CTATCCTGCA CTGTGGTTCC TCAACAACTA TGTTCTCTTG GAAAACCTGG 4 74 0 

TGATCGTCTT GCTCGATTCC TTGTCACTGA TAGACTTCAA TTAATTCTTG TCGAACCGGA 4 8 00 

TTCTCGAAAA GCCGGATGGG CAATTGTTCG ATTCGTAGGA CTTCTTCAAG ATACAACAAT 4860 

T AATG GAG AT TCTACGGATT CGAAAGTTTT GCATGTTGTG GTGGAAGGGC AACCCTCGAG 4 92 0 

AATTAAGGTA AGAATACTAA CGGGAAAAAA AAATCAAAAA ATTACTTCTG TTTCAGAAAA 4 980 

GACATCCGGT TTTAACTGCA AAGTTCATAT TCGATGATCA CATTCGGTGT ATGGCAGCAA 5 04 0 

AGCAACGGCT CACCAAGGTA ACGGAAAAAA , TAAC CAAAAA GACGGAAAGT TATTGTAAAT 510 0 

GGACGAAATC GGCGAAATTA ATTGAAAACG TTTGAATTTG CCGCTAAAAC CAAACGAAAA 5160 

C C AAACG AAA GCGAAATTTA ACTATCCCTT CAGGTAGAAT ATACATTTTA TTTCTCTTTA 52 2 0 

TAGGGTCGCC AAACAGCACG TGGTCTGAAA CTTCAGGCGA TATGTTCAGC TCTTGGAGTT 52 8 0 

CCACGTATCG ATCCAGCGAC AATGACGTCA TCACCACGAA TGAATCCATT CAGAATTGTG 5 34 0 

AAAGGATGCG CACCGGGAAG TGTACGAAAA ACTGTTTCCA CATCATCATC GTCAAGCCAA 54 0 0 

GGACGTCCCG GACATT ATT C TGCAAATCTT AGATCAGCAT CTAGAAATGC AGGAATGATA 54 60 
CCAGATGATC CAACTCAACC GAGTAGTTCT TCGGAAAGAA GATCCTAGGG ATCAATATCT - 5 52 0 

CTT CAGTTTC ATCATTTTAT GCTGTAAATT GTATTTAAGT ATTCCTATTC TTTGTAGTAC 558 0 

TGTATTTACA CATCGTCTAG TTAAAATCAC AAATCTCCGA AAAAACAAAC CAGTGAACAT 5 64 0 

GTGATATTTC TCTTGCCCAT AGTTCTCTTT TTTTTTTGAA ACAAAAACAA TTACTTTTAT 5700 

GCTCACCTAT TCGAGCCATA TTTTTTTCCC AATTACCGGT TGTTTATTTT AATTTCTTTT 5760 

TTTTTTCTGT AAATCTACTT TATTTTTAAA ACTG CATTTG AGATTGTGTA TATTTTTTCA 5 82 0 

AAATGGTTCA AATGCCGAAT CTATCTACTT TTTAATCATT ATTCAAACAG AAAAACCGAT 5 880 

TATTTATTCA GATTCTCAAA AATGGCTGAA AAAGCTGAAA ATCTTCCATC TTCTTCGGCC 5 94 0 

GAAGCTTCAG AAGAGCCATC ACCTCAAACT GGACCAAATG TGAATCAAAA ACCATCGATT 6000 

TTGGTTCTTG GAATGGCTGG TTCTGGAAAA AC G AC ATTTG TTCAGGTAAC TTTCATTCAA 6060 

TTTTGAGAGT TTTCAAACAT TACTATTTTC AGCGTCTCAC AGCATTCCTA CATGCTCGTA 612 0 

AAACACCTCC ATATGTGATT AATCTGGATC CGGCAGTTAG CAAAGTACCT TATCCAGTGA 618 0 

ATGTTGACAT TCGAGATACT GTGAAATACA AGGAAGTTAT GAAAGAATTC GGAATGGGAC 62 4 0 

CAAATGGAGC AATTATGACA TGTCTTAACC TGATGTGTAC TCGTTTTGAT AAAGTAATTG 63 0 0 

AGTTGATTAA TAAGAGATCT TCTGATTTCT CAGTTTGTCT TCTTG AT AC T CCTGGACAAA 63 6 0 

TTGAAGCATT CACTTGGAGT GCTAGTGGAT CTATTATCAC TGATTCATTG GCAAGTAGCC 64 2 0 

ATCCCACGGT AAGGGATTTT GATTTATGAA ATCTGCTTGA AATGAAAAAA GATTCTAATA 64 8 0 



4 



AATTTTTGAC TTTTAAACAT TTTTTACAGT TATATTTGGT CTATTTTCTA TCATTAAAAG 6540 

CAAAATGAAA AGTCGATTCT ACTCCATATT TATTAATTTC GACTTTTCAG GTGGTAATGT 6 600 

ACATTGTGGA TTCCGCTCGT GCCACAAATC CAACTACATT CATGTCCAAT ATGCTCTACG 6660 

CATGTTCCAT TCTCTACCGT ACCAAACTTC CATTCATTGT CGTTTTCAAC AAAGCTGATA 672 0 

TTGTCAAACC AACATTTGCA CTCAAATGGA TGCAAGATTT CGAAAGATTT GATGAAGCTT 67 80 

TAGAGGATGC CAGAAGCAGT TATATGAATG ATTTGAGTCG TTCATTGAGT CTCGTTCTTG 684 0 

ATGAATTCTA TTGCGGACTG AAAACAGGTT TTTATTCGAA ATAAAACCTT TTTTAAATAA 6 900 

TAAATTTCAG TTTGCGTCAG TTCTGCAACT GGAGAAGGAT TCGAAGATGT AATGACAGCA 6960 

ATCGATGAAA GTGTTGAAGC ATACAAAAAA GAATATGTTC CAATGTATGA AAAAGTGTTG 702 0 

GCTGAGAAAA AACTATTGGA TGAGGAGGAG AGAAAGAAAA GAGATGAAGA GGTAATTGTA 7080 

GTAATTTAAT TCTGATTATC TTCAAATTTT CAGACTCTGA AAGGAAAAGC TGTTCACGAC 714 0 

CTGAACAAAG TCGCCAATCC CGACGAATTT CTGGAGTCGG AGTTGAATTC AAAAATCGAT 72 00 

AGAATTCATT TGGGCGGAGT CGATGAAGAG AATGAGGAGG ATGCTGAACT CGAAAG AT C C 72 6 0 

TGATTTTCTT TTTGTTTTTG AATTTTTATT CTATTTTGAT CCCTGTTTAC TTCTTATTGT 73 2 0 

TCTCATTTTG TTGCGTTGTT TTACATTTTA CTCATTTTTG CATAAACTTG TTGCAAAAAT 73 80 

CAATATAATT TTTGATCTGG AAATGGTTTT AAACCTTAAC CTTTCATATA TTAATAATTT 744 0 

TTTTTCAAAA AAACGTTCTA AAAAGGTTCC TCATTTTTTC AATATAGGAA ATTTTGAAGA 7500 

TCTTTTCCAA AAATGAGGTT CTTCGCTTGA- AAAGCCAACA TTTAAAACCT TTTTTTTTCC 7560 

AGAAACCTAG TGGTTAATGT CTGAAAAGAC GTTCCACAAG G C AC AG AC C A TCCGTGCAAA 7620 

GGCATCCGGA GTGCCTTCAA TCGTCGAAGC TGTACAGTTT CATGGAGTTC G CAT C AC AAA 76 80 

AAACGATGCT TTGGTTAAGG AGGTACTACC CAAATTTCAA AATGTTGCAC AATTCAATTG 7 74 0 

AAAATATAAA TTGTGAATTA AATTCAACTT ACATGTTTTT TCAGGTTTCC GAATTATACA 7800 

GAAGTAAAAA TCTAGATGAA CTTGTTCATA ACTCTCATCT GGCGGCTCGT CATCTTCAAG 7 860 

AAGTTGGATT AATGGATAAT GCAGTTGCTC TAATTGATAC ATCTCCAAGC TCAAATGAAG 7 92 0 

GATATGTTGT CAATTTCCTA GTTCGAGAAC CAAAATCATT CACTGCTGGA GTCAAAGCAG 7 98 0 

GAGTTTCAAC GAATGGAGAT GCGGATGTCA GTTTAAATGC CGGAAAACAA AGTGTTGGAG 8 04 0 

GACGAGGAGA GGCAATCAAT ACACAGTATA CATATACTGT AAAGGTAAGG ACGAGAGTTG 8100 

GCACTGCCAG TTTGGCATGT TCTCCCAATA TTTTTTAATT ATAAAATTTG GAAGTATAAA 8160 

AAAATGTTTG CTTCATCTAA AAATAGCCTT TTTCACATGA AAAAAATTGA AAAAAAG TG C 822 0 

TCAAAAATTT CAGAAATTTC CAATTTCCAA ACAATTTTGG AGAACTTTCA AAAATTTTTC 82 80 

CAACTGAAAT TAAAGCTATA TTCTATCACT AAATTTTATA CAAGTCTTAA G AG AAAAT G A 8340 

TGAAGTGGCT CATTTTGTAG AATTTCCTAA AAAATAATAT CTTCAGGGCG ATCACTGCTT 84 00 

CAACATTTCC GCAATCAAAC CATTCCTGGG ATGGCAAAAA TATTCGAATG TAT C AG C G AC 84 60 

TCTATACCGT TCACTTGCAC ATATGCCATG GAATCAATCA GATGTTGATG AGAATGCAGC 852 0 

TGTTCTTGCA TATAATGGAC AACTATGGAA TCAAAAGCTT TTGCATCAAG TCAAATTGAA 85 80 

TGCGGTAAAG TATTATAAGT GTTTTGTCCA AACTATGATA CAGTTCTTCA GATATGGAGA 8 64 0 

ACACTTCGTG CCACTCGAGA TGCCGCATTT TCAGTTCGTG AACAAGCCGG ACACACTTTG 8700 

AAATTC T CGT TGGAGAATGC TGTAGCTGTT GATACAAGAG ATAGACCTAT TCTTGCAAGT 8760 

CGTGGAATTC TTGGTAAGAG TAACAACGAC TATTTTTAAA AAATATCTTT TTCGAAAAAA 8 82 0 
TTACGAACGA AAAAAAACTG TAT T ATGT AC CCAAACGCGA AATTTTGCAG TTCTTGCGCG ■ 88 80 

TTCTTGTTGA TAAAAAATAT GTAAAAAATT GGAAAAACTA CGAAAAGTCG ATAAAAATTC 8 94 0 

CGTACCAACC GGAAAATGTT T CAT T AATTT CTCTTCCTTT TTTCAGCTCG TTTTGCTCAA 9000 

GAGTACGCAG GAGTATTTGG TGATGCGTCA TTTGTGAAGA ATACATTAGA TTTACAGGTA 9060 

ACAACCTTAT TTCAACAATT ATTTCAAATT CTATTAAAAA TAATTCCAGG CAGCTGCCCC 912 0 

TCTTCCACTC GGTTTCATTC TTGCCGCCTC ATTCCAAGCG AAACATTTGA AAGG AC TCGG 9180 

AGATCGAGAA GTTCATATTT TGGATAGATG TTATTTGGGT GGACAACAGG ATGTTCGAGG 924 0 

ATTTGGTCTG AATACTATTG GAGTGAGTTT TAACGAAATT CTCTTGAAAG TCAAATAATC 93 0 0 

ATTTTCAGGT T AAAG C AG AT AACAGTTGTC TTGGAGGAGG TGCTTCACTT GCTGGTGTCG 93 60 

TTCATTTGTA TCGGCCATTG ATTCCACCAA ATATGCTATT TGCACACGCA TTCCTTGCAT 942 0 

CTGGAAGTGT TGCATCAGTT CATTCCAAAA ATTTGGTGCA ACAATTACAG GATACTCAAC 94 8 0 

GAGTATCAGC CGGATTTGGT GAGTTTGAAA TTTAGGAAAC ATTTGGATGA AATGTATTTT 954 0 

TTAAAAATAG ATCAGCTTTA TTTATTTGAA AAAAAACGCT CATTAATCAA TAGTGATAGT 9600 

TCCATTCTGA GTTTCTTCTT CTTCCTCGCG GAATACAATT TTTGACTTGT TCGCATCCTT 9660 

CTTGTGTACT TTGTCACCAA TCTTCTCATC AACTAAATCT CGAAACTGAA AAAATTTCAA 972 0 

AATTATTCCA AAAAATATTG ATGCAGACTA CCTTTTTGAT GGCTTCTGGT ACGTTTCTAG 97 8 0 

CGTCGAATGG ATTGGCTCCT CCAATAATTA AAGTCTCGTT CGGTAGTTTA GCCAGACGGA 984 0 
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CGGTGTGCTT CAACATTTTT CTAATTAATC TATTTCAATT CAAGTCACTC ACTCTCTCTT 990 0 

GACGTCTTCT TCTATATTCC AAGAACTCTG CAGAAAATCC GTGTCCGCCT TGTGTGTTTC 9 96 0 

TAGTTGGCGT CGGAGGATTC ACGGGTCCAA GACGAATGGA TGTCTAAAAA ATGTTATATT 10 020 

TTTGCATAAA GAAAACACCA TACCTTCACC ACTTTTTGAG TTGTGGGCGT TCTGAATGGA 10080 

ATTGATCGAT TATTATTGCT CTTTCTTGAT TTGCTTCTAT CAGCTGCGTA ATGAGGTGTT 1014 0 

CTAAAGATCA GCTTTAATTC ATTTGGACAA GTGCTCCTCT AATAAACTTA CCCTGTACTC 10200 

ATTTTTGAAA CGATTTACGA TGATAAGATT GAAAGTGGAA GTTAAATTTA GTCTTTCAAA 102 6 0 

GTTGAAATAA AATCTTCATA AATAAATAAA TTTAAATGAA AGATTAAATA AATTAACGTT 10320 

CACGTAGTTA AAAAAATAAT TTAAATCTTA ACTTCTAATA AAAAATCTCA ATTTTCCAGG 103 8 0 

ACTCGCATTC GTGTTCAAAA GTATTTTCCG GCTGGAACTC AACTACACGT ATCCATTGAA 1044 0 

ATATGTGCTC GGCGATTCAT TGCTCGGTGG ATTCCATATT GGAGCTGGTG TCAACTTCTT 10500 

GTAGAGATTA ATTGGATGCA AGCACCCCTC AAAAAGATTT TTTTGAAAAA CGATAAATTC 10560 

ACAGAATTTC AGTTCTTTTT CTCCCCCTTT TATTGTTATT TTCATCGTAA TGCTGTGCTA 10 620 

GAAGTCAGAG TAAATATGAG TTTTTTTGTG TTCTAGGAAT TCCATTTTTT CAGGAAGCAA 106 80 

ATTTAATAAA AATTATCGAA TTTCTTGCTC TAAAGATGTT GTACATTTTA TGGAAATGTT 107 4 0 

CGTATAGTAA TTCGAACACT TTATATTTCT CGTTTTAAAA CTGTCGGTGT TTTATAGTAA 10 8 00 

ACTATCTTCA GAAAAAAATG AGCCTACGAA AAATCAATTT CGTAACTGGA AACGTGAAGA 10 860 

AGCTTGAAGA AGTCAAGGCT ATTTTGAAGA ATTTCGAGGT AAAAT AT AT T TGATATTATT 10 920 

CGAACGCGAA ATTTTGCGCC AAAAGTACGA TGCCTGGTCT CAACACGACA ATATTTTGTT 10 980 

AAATACAAAC GAATGTGCGC CTTCAAAGAA AAGTTTCAAT CTTTCGTTGC CGTGGAGATA 1104 0 

TTTTTAGAGT TTTTGTTTAA ATTATATATT TGTCGTATCG AAACCGGGTA CCGTAATCAA 1110 0 

TCAATTAAAT ATTTTCAGGT TTCAAACGTG GATGTCGATT TGGATGAATT CCAAGGAGAA 11160 

CCCGAATTTA TTGCCGAAAG AAAGTGCCGT GAGGCTGTTG AAGCTGTAAA AGGGCCCGTT 112 2 0 

TTGGTATGGA AAATTGTATT TGTTCTAAAA ATTGTCAAAT TTCAGGTCGA AGACACAAGT 112 80 

TTATGCTTCA ACGCAATGGG CGGTCTTCCT GGACCTTATA TCAAGTGGTT TTTGAAGAAT 1134 0 

TTGAAACCAG AAGGACTACA TAATATGCTA GGTAAATATT TTAATTTTTT GAAAAAACTT 114 0 0 

ATTTTTCAGC CGGATTTTCT GACAAAACCG CCTATGCTCA ATGCATCTTT GCGTACACTG 114 60 

AAGGACTCGG AAAACCTATT CATGTATTTG CTGGTATGAT TTTTTGAATT TAATTCTTTA 1152 0 

ATTTTATATG TTAATTTAGT TGTTTCATTC CTCAATTTAT GAGAGATTTT TTTTTCAATT 115 8 0 

TTTCTATTTC AGGAAAATGT CCTGGTCAAA TTGTTGCTCC AC G T GGTG AT ACTGCTTTTG 1164 0 

GATGGGATCC ATGCTTCCAG C C AG ATGGTT TTAAAGAAAC ATTCGGAGAA ATGGATAAAG 117 00 

ATGTAAAAAA TGAAATTTCT CATCGTGCAA AGGCTCTGGA ACTCCTCAAG GAATATTTTC 117 60 

AGAATAATTA AATTATTTTT TCTCATCTAT GCAATTTCTT GAAAATTTGT TAAGTTTCCG 1182 0 

TTGTTATGCA TTTGCTTTTA TTTAAAAAAA AAAGAATATT T TT AC ATT AA TATTAGATAT 118 80 

GAGAAAAGAG TAATTTCTGG ATTTTAACCT TCCTACAAAA GAATATTTAT ATTTTTTGTA 11940 

TGATTTTTTA AAAATATCGT CAGGAAATAA TAACATTTCA GATATACCCT GAACTCTACA 12 0 00 

GTTTATGATA TTCAGGAAAT TTCTGAATTT TCTGAAACCT T AC AAAATG C GAACGGATCC 12 060 

GATTATTTTC GTGATTGGGT GCACTGGAAC CGGGAAAAGT GATCTTGGAG TGGCAATTGC 1212 0 

AAAGAAATAT GGAGGAGAGG TGATTAGTGT AGATTCAATG CAATTTTATA AAGGTACATG 12180 

GGTTTTGTTT CAATTTTAAA TTAATTAATT TTCGTTTTTC AGGACTTGAC ATTGCCACGA 12240 

ATAAGATAAC GGAAGAAGAA TCTGAAGGGA TTCAAC AT C A TATGATGTCA TTTTTGAATC 12 3 00 

CATCTGAATC AT CAT CTT AT AATGTACATA GTTTC CGAGA AGTCACGTTG GATCTTATTA 12 3 60 

AAGTGCTTAA TTCGCCACTT TTTGAACTTG ATCCTAATTT TCATAATTTT CAGAAAATCC 124 2 0 

GCGCCCGTTC AAAAATTCCT GTAATTGTCG GAGGAACCAC TTATTATGCT GAAAGTGTCC 124 80 

TTTATGAGAA TAATCTGATT GAAACCAACA CTTCAGATGA CGTGGATTCC AAATCGAGAA 12 54 0 

CATCATCAGA ATCGTCATCT GAAGACACTG AAGAAGGAAT TAGTAATCAA GAATTATGGG 12 600 

ATGAATTGAA AAAAATCGAC GAAAAATCAG CACTTCTTCT ACATCCAAAT AATCGTTATC 12 660 

GAGTACAGAG AGCATTGCAA ATTTTCAGAG AAACTGGTAA TTGATTTGCA AATTTCCAGA 12 72 0 

TTAAAAACAA ATCAAGTAAA GTTTTTTGCA GGAATCCGAA AAAGTGAACT TGTTGAAAAA 12 7 80 

CAGAAATCAG ATGAAACTGT TGATTTGGGT GG ACGACTAC GATTTGATAA TTCTTTAGTT 12 84 0 

ATTTTTATGG ATGCAACACC TGAAGTTTTA GAAGAAAGAC TTGATGGAAG AGTTGATAAA 12 900 

ATGATTAAAT TGGGTTTGAA GAATGAATTG ATCGAGTTTT ATAACGAGGT AAATATTTGA 12 960 

ATTTTTCCAG AAAAAAAAAG AAAATTTTTT ATTATTTTGT TTTTTTTTCA TTCTTTACTA 13 020 

TTTTCCAAAA AAGTTTAAAC TTTTGAAAAC TGTTCAGAAA ATGTTCGTGT ATTTATTTTA 13 0 80 

GCTTACTGAG GCATTATTTC ATTGTGATTT TTACTATACT CTATAAACTA AATTTTCAGC 1314 0 

ACGCCGAGTA CATAAATCAC AGCAAATATG GTGTCATGCA ATGTATTGGT CTTAAAGAAT 132 00 
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TCGTTCCATG GCTCAATTTG GACCCATCAG AAAGAGATAC ACTCAATGGG GATAAATTGT 13260 

TCAAGCAAGG GTAATTTAAA TTTATTTTCA ATTTTTATAA ATTCCAAGCT ATTTTCAGAT 13 32 0 

GCGATGATGT GAAGCTTCAC ACTCGACAAT ATGCACGGCG CCAGAGACGG TGGTATCGAT 13380 

CGAGACTTTT AAAACGGTCG GATGGTGATC GGGTATGTTG ATTTTAAAAA AATTGAATTT 1344 0 

TTAAAGAACT TTTTTACTAA ATTAACAAAG TTATTGGCTG AAAATGGCTG AAAATTATAG 13 5 00 

TAAAACTAAT CAAAAAAATT GAAATTTTGA ATTAAAGTCA TAAAGTGACG AC C AG AAAAT 13 560 

TAAAAAAAAA CATTTTTCTA TTTTAATTAA TTCACTCTAC TTCACTTTAA AAATAATTTT 13 62 0 

CAGAAAATGG CAAGTACAAA AATGCTGGAT ACATCTGACA AGT AC CGAAT AATTAGTGAT 13 6 80 

GGAATGGACA TTGTTGATCA ATGGATGAAT GGAATCGATC TATTTGAAGA TGTAAAATTT 13 74 0 

CACAAATTCT AAAATTTCCG AATCACAAAT TAAAATTTCT ACAGATCTCC ACAGACACCA 13 800 

ATCCAATTCT AAAAGGGTCC GATGCAAATA TTCTGCTGAA TTGTGAAATC TGTAATATTT 13 8 60 

CAATGACTGG AAAAGATAAT TGGTTTGTTT CAATACATAT TATAATTTCG AAATGAATTT 13 92 0 

TTTCAGGCAG AAACATATCG ATGGGAAAAA GCACAAGCAT CATGCTAAGC AAAAGAAATT 13 98 0 

GGCAGAGACT CGCACATAAG ACGCTATATT TATTTTTTGT TAACTTAAAT TATTTTTGTT 14 04 0 

GTTGATTGTT CTCTAAATAA AAAAACAGCT CAGAGAGAAG ATTAGGCGCT CGTCCACATC 14100 

TCCGACGATA GTCAACCCGA ACGAAGGGAA CTATCTTTAA TTGTCAGTGA TGACGTCATG 14160 

TCGTCAAGAA CTCGTCATAG CTGTGAGAAT TGAACCATTA TAGATTTGGA CATTAGTTTA 1422 0 

GGTTATATCC AGTACACTAA ATGGTACATG ATAGACAGTG TACATTTACA GATTTATAGA 142 80 

TTGTCTCAGT GACTAGTTAC CGGAAGAGGA GAGGAGAACA TGTGGCGATG TCTTTTGGAT 1434 0 

CGATATTATT CCGTCTGAAA ATTGTTCACT AGGGGGACTG CCGATTACCA CTTCACATGA 144 00 

CGGAACATGT TAGTT AAAAT ATTGGCTTTT ATACACATTT TCAAAATAGC ACCTGTAT 144 58 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met 


He 


Phe 


Arg 


Lys 


Phe 


Leu 


Asn 


Phe 


Leu 


Lys 


Pro 


Tyr 


Lys 


Met 


Arg 


1 








5 










10 










15 




Thr 


Asp 


Pro 


He 


He 


Phe 


Val 


He 


Gly 


Cys 


Thr 


Gly 


Thr 


Gly 


Lys 


Ser 








20 










25 










30 






Asp 


Leu 


Gly 


Val 


Ala 


He 


Ala 


Lys 


Lys 


Tyr 


Gly 


Gly 


Glu 


Val 


He 


Ser 






35 










40 










45 








Val 


Asp 


Ser 


Met 


Gin 


Phe 


Tyr 


Lys 


Gly 


Leu 


Asp 


He 


Ala 


Thr 


Asn 


Lys 




50 










55 










60 










lie 


Thr 


Glu 


Glu 


Glu 


Ser 


Glu 


Gly 


He 


Gin 


His 


His 


Met 


Met 


Ser 


Phe 


65 










70 










75 










80 


Leu 


Asn 


Pro 


Ser 


Glu 


Ser 


Ser 


Ser 


Tyr 


Asn 


Val 


His 


Ser 


Phe 


Arg 


Glu 










85 










90 










95 




Val 


Thr 


Leu 


Asp 


Leu 


He 


Lys 


Lys 


He 


Arg 


Ala 


Arg 


Ser 


Lys 


He 


Pro 








100 










105 










110 






Val 


He 


Val 


Gly 


Gly 


Thr 


Thr 


Tyr 


Tyr 


Ala 


Glu 


Ser 


Val 


Leu 


Tyr 


Glu 






115 










120 










125 








Asn 


Asn 


Leu 


He 


Glu 


Thr 


Asn 


Thr 


Ser 


Asp 


Asp 


Val 


Asp 


Ser 


Lys 


Ser 




130 










135 










140 










Arg 


Thr 


Ser 


Ser 


Glu 


Ser 


Ser 


Ser 


Glu 


Asp 


Thr 


Glu 


Glu 


Gly 


He 


Ser 


145 










150 










155 










160 


Asn 


Gin 


Glu 


Leu 


Trp 


Asp 


Glu 


Leu 


Lys 


Lys 


He 


Asp 


Glu 


Lys 


Ser 


Ala 



7 











lob 










17 0 










1 TC 
1 / O 




Leu. 


Leu 


Leu 


m s 


Pro 


Asn 


Asn 


Arg 


Tyr 


Arg 


vai 


Gin 


Arg 


ai a 


Leu 


Gin 


















1 oc 
10 J 
















lie 


V» i=k 
rile 


Arg 


GlU 


1 nr 


Giy 


lie 


Arg 


Lys 


Ser 


GlU 


Leu 


vai 


GlU 


Lys 


Gin 






-L J D 




















one 

Z U D 








Lys 


S6T 


Asp 


pi,, 
bill 


i. nr 


vai 


Asp 


Leu 


Giy 


Giy 


Arg 


Leu 


Arg 


File 


Asp 


Asn 




^ J. VJ 










^ 1 j 










Z 4S U 










Ser 


Leu 


vai 


lie 


Fne 


Met 


Asp 


Ala 


Thr 


Pro 


Glu 


vai 


Leu 


GlU 


GlU 


Arg 


one 
Z Z 3 










9 n 




















z *t u 


Leu 


Asp 


Gly 


Arg 


vai 


Asp 


Lys 


Met 


He 


Lys 


Leu 


Gly 


Leu 


Lys 


Asn 


GlU 










O A. C 
^ *k 3 




















ZOO 




Leu 


lie 


blU 


Frie 


Tyr 


Asn 


GlU 


His 


TV 1 -1 

Ala 


GlU 


Tyr 


Tl „ 

lie 


Asn 


rll S 


Ser 


Lys 








O G. Ct 










ore 
Z O D 










0 *7 n 






Tyr 


Giy 


vdl 


Met +- 


Gin 


Cys 


Tl n 

lie 


1 1 r 

biy 


Leu 


Lys 


/"•In 
GlU 


Fne 


vai 


Pro 


Trp 


Leu 






97c 










Ton 










O fl c; 
jC O 








Asn 


Leu 


Asp 


Piro 


Ser* 


ulU 


Arc* 


Asp 


inr 


Leu 


Asn 


Giy 


Asp 


Lys 


Leu 


Fne 














z J J 










inn 
j u "j 










Lys 


Gin 


Giy 


Cys 


Asp 


Asp 


vai 


Lys 


Leu 


ril S 


T'Vi -v- 

1 nr 


Arg 


Gin 


Tyr 


Aia 


Arg 


i n 










jlU 










TIC 
J 1 3 










.3 Z U 


Arg 


bin 


Arg 


Arg 


Trp 


Tyr 


Arg 


Ser 


Arg 


Leu 


Leu 


Lys 


Arg 


Ser 


Asp 


Gly 










J 










~\ 1 n 










lie 
J J J 




Asp 


Arg 


Lys 


wee 


Aia 


Ser 


Thr 


Lys 


Met 


Leu 


Asp 


Thr 


Ser 


Asp 


Lys 


Tyr 








J *± u 










J ft 3 










j DU 






Arg 


lie 


lie 


Ser 


Asp 


Gly 


Me t 


Asp 


lie 


vai 


Asp 


Gin 


Trp 


Met 


Asn 


Gly 






ICC 

J J 3 




















-5 O 3 








lie 


Asp 


Leu 


Pile 


Glu 


Asp 


He 


Ser 


Thr 


Asp 


Thr 


Asn 


Pro 


He 


Leu 


Lys 




370 










375 










380 










Gly 


Ser 


Asp 


Ala 


Asn 


He 


Leu 


Leu 


Asn 


Cys 


Glu 


He 


Cys 


Asn 


He 


Ser 


385 










390 










395 










400 


Met 


Thr 


Gly 


Lys 


Asp 


Asn 


Trp 


Gin 


Lys 


His 


He 


Asp 


Gly 


Lys 


Lys 


His 










405 










410 










415 




Lys 


His 


His 


Ala 


Lys 


Gin 


Lys 


Lys 


Leu 


Ala 


Glu 


Thr 


Arg 


Thr 












420 










425 










430 







(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 041 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTGCCATAAG ATGGCGTCCG TGGCGGCTGC ACGAGCAGTT CCTGTGGGCA GTGGGCTCAG 60 

GGGCCTGCAA CGGACCCTAC CTCTTGTAGT GATTCTCGGG GCCACGGGCA CCGGCAAATC 12 0 

CACGCTGGCG TTGCAGCTAG GCCAGCGGCT CGGCGGTGAG ATCGTCAGCG CTGACTCCAT 180 

GCAGGTCTAT GAAGGCCTAG ACATCATCAC CAACAAGGTT TCTGCCCAAG AGCAGAGAAT 24 0 

CTGCCGGCAC CACATGATCA GCTTTGTGGA TCCTCTTGTG ACCAATTACA CAGTGGTGGA 3 00 

CTTCAGAAAT AGAGCAACTG CTCTGATTGA AG AT AT AT TT GCCCGAGACA AAATTCCTAT 3 60 

TGTTG TGGG A GGAACCAATT ATTACATTGA ATCTCTGCTC TGGAAAGTTC TTGTCAATAC 42 0 

CAAGCCCCAG GAGATGGGCA CTGAGAAAGT GATTGAC CGA AAAGTGGAGC TTGAAAAGGA 4 80 

GGATGGTCTT GTACTTCACA AACGCCTAAG CCAGGTGGAC C C AG AAATGG CTGCCAAGCT 54 0 
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GCATCCACAT GACAAACGCA AAGTGGCCAG GAGCTTGCAA GTTTTTGAAG AAACAGGAAT 6 00 

CTCTCATAGT GAATTTCTCC ATCGTCAACA TACGGAAGAA GGTGGTGGTC CCCTTGGAGG 66 0 

TCCTCTGAAG TTCTCTAACC CTTGCATCCT TTGGCTTCAT GCTGACCAGG CAGTTCTAGA 720 
TGAGCGCTTG GATAAGAGGG TGGATGACAT GCTTGCTGCT GGGCTCTTGG AGGAACTAAG 7 80 

AGATTTTCAC AGACGCTATA ATCAGAAGAA TGTTTCGGAA AATAGC CAGG ACTATCAACA 840 
TGGTATCTTC CAATCAATTG GCTTCAAGGA ATTTCACGAG TACCTGATCA CTGAGGGAAA 900 
ATGCACACTG GAGACTAGTA ACCAGCTTCT AAAGAAAGGA CCTGGTCCCA TTGTCCCCCC 960 

TGTCTATGGC TTAGAGGTAT CTGATGTCTC GAAGTGGGAG GAGTCTGTTC TTGAACCTGC 102 0 

TCTTGAAATC GTGCAAAGTT TCATCCAGGG CCACAAGCCT ACAGCCACTC CAATAAAGAT 10 80 

GCCATACAAT GAAGCTGAGA ACAAGAGAAG TTATCACCTG TGTGACCTCT GTGATCGAAT 114 0 

CATCATTGGG GATCGCGAAT GGGCAGCGCA CATAAAATCC AAATCCCACT TGAACCAACT 12 00 

GAAGAAAAGA AGAAGATTGG ACTCAGATGC TGTCAACACC ATAGAAAGTC AGAGTGTTTC 12 60 

CCCAGACTAT AACAAAGAAC CTAAAGGGAA GGGATCCCCA GGGCAGAATG ATCAAGAGCT 13 2 0 

GAAATGCAGC GTTTAAGAGA CATGTCCAGT GGCCTTTGGA AAGGTGGTGG GGATCCAGTT 13 80 

CAGGAGGGAG GGGTATGTTT GTCTCCCAGT CTGGGCAAAG GAGTGCTATG CGGAATTCTC 14 4 0 

TGCATAGCAG AAAAGCTCCC ACCATTTTCT TTTGATGTGG TTTTAAAGTC TCACGTTCTC 15 0 0 

TATAATAGAA ACAGCAGGTC TTGTCAGCTC CTTGTGTGGC TGATGTGTCT GGAAATGATG 15 60 

TAGTTCAGGA AAGCATTTTT TTTTTCTTTG AACCTTAAAG GTTCTATTAT TAAAAGCAGC 162 0 

ACAGATTCCA CATTTTTATA CATGAGGATC TTCTTTGTGG TGAAT AC CAG GATTGACTGC 16 8 0 

ATCCCTTTAA AAGAAGTTTT ATGTCCCTGA CTCTGGCTAA AATTATCTAA TTTCCAGATG 17 4 0 

CTTTTGTAGA TGACTGAAGT ATTTGTGAGC CACATATTGG GAGTTCTAGA TTTGAGTGAA 18 0 0 

TGGCAGGAAA GGGCCATCTC CATTGAGATG ATTAAGTGAA CCAAACTAGT TCTCGGAATT 18 60 

CTACAGAGAA GGAGGGAATC AGACTGAGGA AGCTGTGACA TAGGACTTGA AGACCAAAGA 192 0 

CTTTGAAATT TGCGAGCTGC TCATGTGTGA GTTATTATCA CTGCTGTCTT TCTATTGAGT 19 80 

TACAAATCTA TATTTTTATT GAAGTTTAAA TAAAGAAAAA ATTTACAAGA AAAAAAAAAA 2 04 0 

A 2041 



(2) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met 


Phe 


Arg 


Lys 


Leu 


Gly 


Ser 


Ser 


Gly 


Ser 


Leu 


Trp 


Lys 


Pro 


Lys 


Asn 


1 








5 










10 










15 




Pro 


His 


Ser 


Leu 


Glu 


Tyr 


Leu 


Lys 


Tyr 


Leu 


Gin 


Gly Val 


Leu 


Thr 


Lys 








20 










25 










30 






Asn 


Glu 


Lys 
35 


Val 


Thr 


Glu 


Asn 


Asn 
40 


Lys 


Lys 


He 


Leu 


Val 
45 


Glu 


Ala 


Leu 


Arg 


Ala 
50 


He 


Ala 


Glu 


He 


Leu 
55 


He 


Trp 


Gly 


Asp 


Gin 
60 


Asn 


Asp 


Ala 


Ser 


Val 


Phe 


Asp 


Phe 


Phe 


Leu 


Glu 


Arg 


Gin 


Met 


Leu 


Leu 


Tyr 


Phe 


Leu 


Lys 


65 










70 










75 










80 


He 


Met 


Glu 


Gin 


Gly 
85 


Asn 


Thr 


Pro 


Leu 


Asn 
90 


Val 


Gin 


Leu 


Leu 


Gin 
95 


Thr 


Leu 


Asn 


He 


Leu 
100 


Phe 


Glu 


Asn 


He 


Arg 
105 


His 


Glu 


Thr 


Ser 


Leu 
110 


Tyr 


Phe 


Leu 


Leu 


Ser 
115 


Asn 


Asn 


His 


Val 


Asn 
120 


Ser 


He 


He 


Ser 


His 
125 


Lys 


Phe 


Asp 
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Leu Gin Asn Asp 
130 

Leu Ser Phe Lys 
145 

Thr Thr Glu Glu 

Trp Asn Glu Ser 
180 

He Val Arg Val 
195 

Thr Lys Glu Tyr 
210 

Leu Glu Met Asp 
225 

Arg Glu Arg Leu 

Tyr He Gly Glu 
260 

He Leu Val Thr 
275 

Ser Pro Arg Arg 
290 

Leu Phe Phe Phe 
305 

He Tyr Thr Phe 

Thr Thr His Trp 
340 

Thr Leu Ser Ser 
355 

Phe Asp Phe Leu 
370 

Lys Ala Phe Tyr 
385 

Ala Asp Val Gly 

Glu Ser Thr Thr 
420 

He Ala Ser Thr 
435 

He Gly Val Glu 
450 

Glu Glu- Gin Thr 
465 

Glu Asn Ser Ala 

Ser Arg Ser Arg 
500 

Thr Ser Gly Cys 
515 

Lys Ala Val Gly 
530 

Leu Ala Cys Leu 
545 

Lys Val His Thr 



Glu He Met Ala 
135 

Leu Asn Pro Ala 
150 

Phe Pro Leu Leu 
165 

Met Val Arg He 

Gin Asp Asp Ser 
200 

Leu Ser Glu Leu 
215 

Thr Phe Val Arg 
230 

Arg Gly Lys Val 
245 

Leu Leu Asp Val 

Thr Arg Tyr Leu 
280 

Asp Asn His Ser 
295 

Ser Glu Phe Leu 
310 

Leu Ser Ser Phe 
325 

He Arg His Asn 

Pro Thr Gly Glu 
360 

Leu Glu Ala Phe 
375 

Gly Leu Met Leu 
390 

Glu Leu Leu Ser 
405 

Thr Ser Leu Ala 

Ser Ser He Ser 
440 

Ala Thr Glu Glu 
455 

Leu Glu Asp Leu 
470 

He Ser Asp Pro 
485 

Phe Gin Ser Ala 

Asp Gly Arg Leu 
520 

Thr Asp Asp Asn 
535 

Val He Arg Gin 
550 

Ser Leu Thr Lys 
565 



Tyr Tyr He Ser 
140 

Thr He His Phe 
155 

Val Glu Val Leu 
170 

Ala Val Arg Asn 
185 

Met lie lie Phe 

He Asp Ser Leu 
220 

Ser Ala Glu Asn 
235 

Asp Asp Leu He 
250 

Glu Ala Val Ala 
265 

Ser Pro Leu Leu 

Leu Leu Leu Thr 
300 

Leu He Val Arg 
315 

Leu Phe Asp Thr 
330 

Glu Lys Tyr Cys 
345 

Tyr Val Asn Glu 

Asp Ser Ser Gin 
380 

He Tyr Ser Met 
395 

Ala Ala Asn Phe 
410 

Gin Gin Asn Leu 
425 

Lys Arg Thr Arg 

Asp Glu He Phe 
460 

Val Asp Asp Val 
475 

Glu Pro Lys Asn 
490 

Val Asp Glu Leu 
505 

Phe Asp Ala Leu 

Arg He Arg Pro 
540 

He Leu Met Thr 
555 

Leu Cys Phe Glu 
570 



Phe Leu Lys Thr 

Phe Phe Asn Glu 
160 

Lys Leu Tyr Asn 
175 

He Leu Leu Asn 
190 

Ala He Lys His 
205 

Val Gly Leu Ser 

Val Leu Ala Asn 
240 

Asp Leu He His 
255 

Glu Ser Leu Ser 
270 

Leu Ser Ser lie 
285 

Pro He Ser Ala 

His His Glu Thr 
320 

Gin Asn Thr Leu 
335 

Leu Glu Pro lie 
350 

Asp His Val Phe 
365 

Ala Asp Asp Ser 

Phe Gin Asn Asn 
400 

Pro Val Leu Lys 
415 

Ala Arg Leu Arg 
430 

Ala He Thr Glu 
445 

His Asp Val Pro 

Leu val Asp Thr 
480 

Val Glu Ser Glu 
495 

Pro Pro Pro Ser 
510 

Ser Ser He He 
525 

lie Thr Leu Glu 

Val Asp Asp Glu 
560 

Val Arg Leu Lys 
575 



m 

10 



Leu 


Leu 


Ser 


Ser 


Tl fl 

lie 


(jiy 


p i _ 
uin 


Tyr 


vai 


Asn 


Gly Glu 


Asn 


Leu 


DV10 
rue 


Leu 








con 

jOU 










CQC 
30D 










can 

ZD ^ \J 






G1U 


Trp 


nU — 
rne 


pi .1 
bill 


Asp 


pi .. 

tjlU 


Tyr 


Aia 


ulU 


File 


p 1 11 


Va 1 
v a -L 


Asn 


m s 


Val 


Asn 






CQC 
J? J 










fi n n 

O U v 










605 








Prie 


Asp 


lie 


Tl a 

lie 


(jiy 


rll S 


pi ,, 

ulU 


fiec 


Leu 


Leu 


Pro 


Ota 

It 1 (J 


Ala 


aia 

Ala 


TVi r* 
1 nr 


Pro 




o J. U 










file: 

O X J 










D ^ U 










Leu 


Ser 


Asn 


Leu 


Leu 


Leu 


rll S 


Lys 


Arg 


Leu 


Pro 


Ser 


Pl Tf 

uiy 




ai ii 

ulU 


P 1 

UlU 


G. 7 EI 










OJU 










ojD 










D *± \J 


Arg 


i le 


Arg 


inr 




Tl a 
lie 


Vo.1 


rile 


Tyr 


Leu 


XIX O 


Tip 


Arg 


Lys 


Leu 


m n 

UlU 










64 5 










650 










655 




Arg 


Asp 


Leu 


i nr 






P 1 -ir 

uiy 


Asp 


1 nr 


Pi ,1 
blU 


Leu 




vai 


a m 
Ax y 


vai 


LlCU 








o o u 










D D O 










670 






Asn 


Ser 


Asp 


Pl n 

Lain 


LllU 


Pro 


vai 


A 1 -1 

Aid 


Tie 

lie 


biy 


Asp 


Cys 


116 


As n 


Leu 


nis 






^ 7 cr 










0 0 u 










O O J 








Asn 


Ser 


Asp 


Leu 


Leu 


Ser 


Cys 


1 nr 


vai 


u a 1 

vai 


Pro 


p 1 _ 
uin 


pi n 

bin 


Leu 


Cys 


Ser 




con 
O _? U 










roc 










*7 a n 










Leu 


pi,, 

\j±y 


Lys 


Pro 


vjiy 


Asp 


Arg 


Leu 


a 1 a 
Ala 


Arg 


rile 


Leu 


Val 


1 nr 


Asp 


Arg 


7 rm 




















*7 1 C 

/Id 










72 0 


Leu 


f 1 n 

(jin 


Leu 


lie 


Leu 


v ai 


pi .. 

bill 


Pro 


Asp 


Ser 


Arg 


Lys 


aia 
Ala. 


Pl T r 

uiy 


Trp 


Al a 










79 R 










73 0 










7 3 5 




Tic* 

i le 


v a. 1 


Arg 


fne 


Val 


uiy 


Leu 


Leu 


P 1 « 

uin 


Asp 


X 111 


X 111 


Tip 
lie 


As n 


p,l v 

biy 


Asp 








1 ACS 










/ *± o 










7 S 0 
t zj \j 






Ser 


i nr 


Asp 


Ser 


Lys 


val 


Leu 


y-! c 

rll s 


ya 1 
Val 


val 


vai 


ulU 


ai w 

biy 


Pl n 
Olil 


"D yp 


Q 0 v 
0 c X 






7 c 










7 c n 










7 £ ^ 








Arg 


lie 


Lys 


Lys 


Arg 


rll S 


Pro 


vai 


Leu 


Tnr 


al a 
ax a 


T ,t/c 
j-iy 0 


DV10 
rile 


lie 




Asp 




7 7 n 




















/OU 










Asp 


rll S 


ne 


Arg 


Cy s 


rlcL 


Aid 


ai a 


Lys 


P 1 

uin 


Arg 


Leu 


1 nr 


Lys 


pl v 

uiy 


Arg 


7 £ c; 










7 q n 










795 










ft n 0 

0 u u 




i nr 


Ala 
Al a. 


Arg 


uiy 


Leu 


Lys 


Leu 


uxn 


Ala 


He 


cys 


Q v~ 
OCX 


Al a 


Leu 


p 1 w 
vjx y 










OU J 










810 










O X ZD 




Vdl 


Pro 


Arg 


ne 


Asp 


Pro 


Ala 


1 nr 




Thr 


Ser 


Cpv- 
OC X 


Pro 


Arg 




As n 








82 0 










82 5 










83 0 






Pro 


Phe 


Arg 


He 


Val 


Lys 


Gly 


Cys 


Ala 


Pro 


Gly 


Ser 


Val 


Arg 


Lys 


Thr 






835 










840 










845 








Val 


Ser 


Thr 


Ser 


Ser 


Ser 


Ser 


Ser 


Gin 


Gly Arg 


Pro 


Gly 


His 


Tyr 


Ser 




850 










855 










860 










Ala 


Asn 


Leu 


Arg 


Ser 


Ala 


Ser 


Arg 


Asn 


Ala 


Gly 


Met 


He 


Pro 


Asp 


Asp 


865 










870 










875 










880 


Pro 


Thr 


Gin 


Pro 


Ser 


Ser 


Ser 


Ser 


Glu 


Arg 


Arg 


Ser 











885 890 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 : 

Met Ala Glu Lys Ala Glu Asn Leu Pro Ser Ser Ser Ala Glu Ala Ser 
15 10 15 
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Glu 


Glu 


Pro 


Ser 
20 


Pro 


Gin 


Thr 


Gly 


Pro 
25 


Asn 


Val 


Asn 


Gin 


Lys 
30 


Pro 


Ser 


lie 


Leu 


Val 
35 


Leu 


Gly 


Met 


Ala 


Gly 
40 


Ser 


Gly 


Lys 


Thr 


Thr 
45 


Pne 


vai 


bin 


Arg 


Leu 
50 


Thr 


Ala 


Phe 


Leu 


His 
55 


Ala 


Arg 


Lys 


Thr 


Pro 
60 


Pro 


Tyr 


val 


x xe 


Asn 


Leu 


Asp 


Pro 


Ala 


Val 


Ser 


Lys 


Val 


Pro 


Tyr 


Pro 


vai 


Asn 


vax 


Asp 


65 










70 










75 










80 


lie 


Arg 


Asp 


Thr 


Val 
85 


Lys 


Tyr 


Lys 


Glu 


Val 

90 


Met 


Lys 


GXU 


rfle 




Met 


Gly 


Pro 


Asn 


Gly 
100 


Ala 


lie 


Met 


Thr 


Cys 
105 


Leu 


Asn 


Leu 


Met 


Cys 
110 


inr 


Arg 


Phe 


Asp 


Lys 
115 


Val 


lie 


Glu 


Leu 


lie 
120 


Asn 


Lys 


Arg 


Ser 


Ser 
125 


Asp 


pne 


Ser 


Val 


Cys 
130 


Leu 


Leu 


Asp 


Thr 


Pro 
135 


Gly 


Gin 


lie 


Glu 


Ala 
140 


Phe 


inr 


Trp 


Ser 


Ala 


Ser 


Gly 


Ser 


lie 


lie 


Thr 


Asp 


Ser 


Leu 


Ala 


Ser 


Ser 


TJ t _ 

HI S 


Pro 


l nr 


145 










150 










155 










160 


Val 


Val 


Met 


Tyr 


lie 
16 5 


val 


Asp 


Ser 


Ala 


Arg 

1 / U 


Ala 

ai a 


J-p V> -v- 

inr 


Asn 


Pro 


i nr 

X / D 


i nr 


Phe 


Met 


Ser 


Asn 
180 


Met 


Leu 


Tyr 


Ala 


Cys 
185 


Ser 


lie 


Leu 


Tyr 


Arg 
190 


Thr 


Lys 


Leu 


Pro 


Phe 
195 


lie 


Val 


Val 


Phe 


Asn 
200 


Lys 


Ala 


Asp 


lie 


Val 
205 


Lys 


Pro 


Thr 


Phe 


Ala 
210 


Leu 


Lys 


Trp 


Met 


Gin 
215 


Asp 


Phe 


Glu 


Arg 


Phe 
220 


Asp 


Glu 


A 1 -3 

Ala 


Leu 


Glu 


Asp 


Ala 


Arg 


Ser 


Ser 


Tyr 


Met 


Asn 


Asp 


Leu 


Ser 


Arg 


Ser 


Leu 


Ser 


225 










230 










235 










240 


Leu 


Val 


Leu 


Asp 


Glu 
2 4 5 


Pne 


Tyr 


Cys 


Gly 


Leu 


Lys 


Thr 


vax 


Cys 


vax 

ZOO 


Ser 


Ser 


Ala 


Thr 


Gly 
260 


Glu 


Gly 


Pne 


Glu 


Asp 
265 


val 


Met 


Thr 


Ala 

Axa 


x xe 
270 


Asp 


GXU 


Ser 


Val 


Glu 
275 


Ala 


Tyr 


Lys 


Lys 


Glu 
280 


Tyr 


val 


Pro 


Met 


Tyr 
285 


G1U 


Lys 


vax 


Leu 


Ala 
290 


Glu 


Lys 


Lys 


Leu 


Leu 
295 


Asp 


Glu 


Glu 


Glu 


Arg 
300 


Lys 


Lys 


Arg 


Asp 


Glu 


Glu 


Thr 


Leu 


Lys 


Gly 


Lys 


Ala 


Val 


His 


Asp 


Leu 


Asn 


Lys 


Val 


Ala 


305 










310 










315 










320 


Asn 


Pro 


Asp 


Glu 


Phe 
325 


Leu 


Glu 


Ser 


Glu 


Leu 
330 


Asn 


Ser 


Lys 


He 


Asp 
335 


Arg 


lie 


His 


Leu 


Gly 
340 


Gly 


Val 


Asp 


Glu 


Glu 
345 


Asn 


Glu 


Glu 


Asp 


Ala 
350 


Glu 


Leu 



Glu Arg Ser 
355 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Ser Glu Lys Thr Phe His Lys Ala Gin Thr lie Arg Ala Lys Ala 

15 10 15 

Ser Gly Val Pro Ser lie Val Glu Ala Val Gin Phe His Gly Val Arg 

20 25 30 

lie Thr Lys Asn Asp Ala Leu Val Lys Glu Val Ser Glu Leu Tyr Arg 

35 40 45 

Ser Lys Asn Leu Asp Glu Leu Val His Asn Ser His Leu Ala Ala Arg 

50 55 60 

His Leu Gin Glu Val Gly Leu Met Asp Asn Ala Val Ala Leu lie Asp 
65 70 75 80 

Thr Ser Pro Ser Ser Asn Glu Gly Tyr Val Val Asn Phe Leu Val Arg 

85 90 95 

Glu Pro Lys Ser Phe Thr Ala Gly Val Lys Ala Gly Val Ser Thr Asn 

100 105 110 

Gly Asp Ala Asp Val Ser Leu Asn Ala Gly Lys Gin Ser Val Gly Gly 

115 120 125 

Arg Gly Glu Ala lie Asn Thr Gin Tyr Thr Tyr Thr Val Lys Gly Asp 

130 135 140 

His Cys Phe Asn lie Ser Ala lie Lys Pro Phe Leu Gly Trp Gin Lys 
145 150 155 160 

Tyr Ser Asn Val Ser Ala Thr Leu Tyr Arg Ser Leu Ala His Met Pro 

165 170 175 

Trp Asn Gin Ser Asp Val Asp Glu Asn Ala Ala Val Leu Ala Tyr Asn 

180 185 190 

Gly Gin Leu Trp Asn Gin Lys Leu Leu His Gin Val Lys Leu Asn Ala 

195 200 205 

lie Trp Arg Thr Leu Arg Ala Thr Arg Asp Ala Ala Phe Ser Val Arg 

210 215 220 

Glu Gin Ala Gly His Thr Leu Lys Phe Ser Leu Glu Asn Ala Val Ala 
225 230 235 240 

Val Asp Thr Arg Asp Arg Pro lie Leu Ala Ser Arg Gly lie Leu Ala 

245 250 255 

Arg Phe Ala Gin Glu Tyr Ala Gly Val Phe Gly Asp Ala Ser Phe Val 

260 265 270 

Lys Asn Thr Leu Asp Leu Gin Ala Ala Ala Pro Leu Pro Leu Gly Phe 

275 280 285 

lie Leu Ala Ala Ser Phe Gin Ala Lys His Leu Lys Gly Leu Gly Asp 

290 295 300 

Arg Glu Val His lie Leu Asp Arg Cys Tyr Leu Gly Gly Gin Gin Asp 
305 310 315 320 

Val Arg Gly Phe Gly Leu Asn Thr lie Gly Val Lys Ala Asp Asn Ser 

325 330 335 

Cys Leu Gly Gly Gly Ala Ser Leu Ala Gly Val Val His Leu Tyr Arg 

340 345 350 

Pro Leu lie Pro Pro Asn Met Leu Phe Ala His Ala Phe Leu Ala Ser 

355 360 365 

Gly Ser Val Ala Ser Val His Ser Lys Asn Leu Val Gin Gin Leu Gin 

370 375 380 

Asp Thr Gin Arg Val Ser Ala Gly Phe Gly Leu Ala Phe Val Phe Lys 
385 390 395 400 

Ser lie Phe Arg Leu Glu Leu Asn Tyr Thr Tyr Pro Leu Lys Tyr Val 

405 410 415 

Leu Gly Asp Ser Leu Leu Gly Gly Phe His lie Gly Ala Gly Val Asn 
420 425 430 



13 



Phe Leu 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Met 


Leu 


Tyr 


lie 


Leu 


Trp 


Lys 


Leu 


Asn 


Tyr 


Leu 


Gin 


Lys 


Lys 


Met 


Ser 


1 








5 










10 










15 




Leu 


Arg 


Lys 


lie 


Asn 


Phe 


Val 


Thr 


Gly 


Asn 


Val 


Lys 


Lys 


Leu 


J.U 


ulU 








20 










25 










30 






Val 


Lys 


Ala 


He 


Leu 


Lys 


Asn 


Phe 


Glu 


Val 


Ser 


Asn 


Val 


Asp 


Val 


Asp 






35 










40 










45 








Leu 


Asp 


Glu 


Phe 


Gin 


Gly 


Glu 


Pro 


Glu 


Phe 


He 


Ala 


Glu 


Arg 


Lys 


Cys 




50 










55 










60 










Arg 


Glu 


Ala 


Val 


Glu 


Ala 


Val 


Lys 


Gly 


Pro 


Val 


Leu 


Val 


Glu 


Asp 


Thr 


65 










70 










75 










80 


Ser 


Leu 


Cys 


Phe 


Asn 


Ala 


Met 


Gly 


Gly 


Leu 


Pro 


Gly 


Pro 


Tyr 


He 


Lys 










85 










90 










95 




Trp 


Phe 


Leu 


Lys 


Asn 


Leu 


Lys 


Pro 


Glu 


Gly 


Leu 


His 


Asn 


Met 


Leu 


Ala 








100 










105 










110 






Gly 


Phe 


Ser 


Asp 


Lys 


Thr 


Ala 


Tyr 


Ala 


Gin 


Cys 


He 


Phe 


Ala 


Tyr 


Thr 






115 










120 










125 








Glu 


Gly 


Leu 


Gly 


Lys 


Pro 


He 


His 


Val 


Phe 


Ala 


Gly 


Lys 


Cys 


Pro 


Gly 




130 










135 










140 










Gin 


lie 


Val 


Ala 


Pro 


Arg 


Gly 


Asp 


Thr 


Ala 


Phe 


Gly 


Trp 


Asp 


Pro 


Cys 


145 










150 










155 










160 


Phe 


Gin 


Pro 


Asp 


Gly 


Phe 


Lys 


Glu 


Thr 


Phe 


Gly 


Glu 


Met 


Asp 


Lys 


Asp 










165 










170 










175 




Val 


Lys 


Asn 


Glu 


He 


Ser 


His 


Arg 


Ala 


Lys 


Ala 


Leu 


Glu 


Leu 


Leu 


Lys 








180 










185 










190 






Glu 


Tyr 


Phe 


Gin 


Asn 


Asn 























195 



(2) INFORMATION FOR SEQ ID NO : 8 : 

\ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



CGAACACTTT ATATTTCTCG 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GATAGTTCCC TTCGTTCGGG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TTTCTGGATT TTAACCTTCC 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
TTTCCGAGAA GTCACGTTGG 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 
TACAGGAATT TTTGAACGGG 



15 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CTTCAGATGA CGTGGATTCC 



20 



GGAATCCGAA AAAGTGAACT 



20 



AAGAGATACA CTCAATGGGG 



20 



ATCGATACCA CCGTCTCTGG 



20 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTGAATCTAC ACTAATCACC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCAATTATCT TTTCCAGTCA 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ACATTATAAA GTTACTGTCC 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTTTAGTTAA AGCATTGACC 
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(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 
ACATCTTTAT CCATTTCTCC 2 0 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TGCAAAGGCT CTGGAACTCC 2 0 



(2) INFORMATION FOR SEQ ID NO : 2 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
AAAAACCACT TGATATAAGG 2 0 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



CATCCAAAAG CAGTATCACC 



20 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



TTAATTGGAT GCAAGCACCC C 
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ATTACTATAC GAACATTTCC 



20 



TTGTAAAGGC GTTAGTTTGG 



20 



CAGGAGTATT TGGTGATGCG 



20 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CGACGGGGAG AAGGTG AC GG 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AAAACTTCTA CCAACAATGG 



(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CGTAATCTCT CTCGATTAGC 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CCGTGGGATG GCTACTTGCC 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TGGATTTGTG GCACGAGCGG 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TTGATTGCCT CTCCTCGTCC 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
ATCAACATCT GATTGATTCC 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CAGCGAGCGC ATGCAACTAT ATATTG AG C A GG 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
AATAAATATT TAAATATTCA GATATACCCT GAACTCTACA G 



(2) INFORMATION FOR SEQ ID NO : 3 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AAACTGTAGA GTTCAGGGTA TATCTGAATA TTTAAATATT TATTC 



(2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GTACGTGGAG CTCTGCAACT ATATATTGAG CAGG 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
ATGACACTGC AGGATAGTTC CCTTCGTTCG GG 
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(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
GTGTTGCATC AGTTCATTCC 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
GCTGTGCTAG AAGTCAGAGG 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
GTTCTCCTTG GAATTCATCC 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
AGTATATCTA GATGTGCGAG TCTCTGCCAA TT 
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(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTAATTGTA CATTTAGTGG 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ATTAACCTTA CTTACTTACC 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO. 
CTAAACTAAG TAATATAACC 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



GTTGATTCTT TGAGCACTGG 
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(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
AATTCGACCA ATTACATTGG 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AACATAGTTG TTGAGGAAGG 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
AATTAATGGA GATTCTACGG 



(2) INFORMATION FOR SEQ ID NO : 52 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) 'TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



TCAGCATCTA GAAATGCAGG 
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(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
CGAATGTCAA CATTCACTGG 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
CTTAACCTGA TGTGTACTCG 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
ATGAAGCTTT AGAGGATGCC 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
CGACGAATTT CTGGAGTCGG 




(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AC TG CAT TAT CCATTAATCC 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
.(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CACCCAAATA ACATCTATCC 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TTTAACCTCA TCTTCGCTGG 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 



ATGTTCCGCA AGCTTGGTTC 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
TTTAATTACC CAAGTTTGAG 



(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 62 : 
TTTTAACCCA GTTACTCAAG 



